Anticipating the future by watching unlabeled video

نویسندگان

  • Carl Vondrick
  • Hamed Pirsiavash
  • Antonio Torralba
چکیده

In many computer vision applications, machines will need to reason beyond the present, and predict the future. This task is challenging because it requires leveraging extensive commonsense knowledge of the world that is difficult to write down. We believe that a promising resource for efficiently obtaining this knowledge is through the massive amounts of readily available unlabeled video. In this paper, we present a large scale framework that capitalizes on temporal structure in unlabeled video to learn to anticipate both actions and objects in the future. The key idea behind our approach is that we can train deep networks to predict the visual representation of images in the future. We experimentally validate this idea on two challenging “in the wild” video datasets, and our results suggest that learning with unlabeled videos significantly helps forecast actions and anticipate objects.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

When will you do what? - Anticipating Temporal Occurrences of Activities

Analyzing human actions in videos has gained increased attention recently. While most works focus on classifying and labeling observed video frames or anticipating the very recent future, making long-term predictions over more than just a few seconds is a task with many practical applications that has not yet been addressed. In this paper, we propose two methods to predict a considerably large ...

متن کامل

The Role of Avatar in Interactive Fictional World of Video Games

In third-person video games, players are able to move and progress in the interactive world of the game while watching their avatar from an external point of view. The purpose of this paper is to investigate the role of avatar in the interactive imaginary world of video games using double vision theory. This article is based on descriptive-analytical methods and the use of library data and imag...

متن کامل

Using video images of dementia in advance care planning.

BACKGROUND Advance care planning is a process by which patients plan for future medical care under circumstances of impaired decision-making. Central to this process is the patient's understanding and ability to imagine future health states. METHODS A before and after oral survey was used to compare the effect of a video depiction with that of a verbal description of a patient with advanced d...

متن کامل

Effect of Video Education on Reduction of Post ETC Complications

Introduction: ECT is an effective and unknown treatment in the psychiatric diseases for which the patients and their families have an illogical fear. Horror of brain injury due to ETC is always with the patients . This study is to investigate the effect of video education on decrease of ECT complications. Methods In this blind study ,the patients were given necessary education about ECT thr...

متن کامل

Learning to Separate Object Sounds by Watching Unlabeled Video

Perceiving a scene most fully requires all the senses. Yet modeling how objects look and sound is challenging: most natural scenes and events contain multiple objects, and the audio track mixes all the sound sources together. We propose to learn audio-visual object models from unlabeled video, then exploit the visual context to perform audio source separation in novel videos. Our approach relie...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1504.08023  شماره 

صفحات  -

تاریخ انتشار 2015